Stable and Low-Distortion Algorithm Based on Overdetermined Blind Separation for Convolutive Mixtures of Speech

نویسندگان

Tsuyoki Nishikawa

Hiroshi Saruwatari

Kiyohiro Shikano

Atsunobu Kaminuma

چکیده

We propose a new algorithm with a stable learning and low distortion based on overdetermined blind separation for the convolutive mixture of the speech. To improve the separation performance， we have proposed multistage ICA， in which frequency-domain ICA and time domain ICA (TDICA) are cascaded. For temporally correlated signals， we must use TDICA with a nonholonomic constraint to avoid the decor relation effect. However， the stability cannot be guaranteed in the non holonomic c回e. Also， in the holonomic case， the sound quality of the separated signal is distorted by the decorrelation effect. To solve the problem of the stability， we perform TDICA with the holonomic con straint. To avoid the distortions， we estimate the distortion components by TDICA with the holonomic constraint ai1d we compensate the sound qualities by using the estimated components. The stability of the pro posed algorithm can be guaranteed by the holonomic constraint， and the proposed compensation work prevents the distortion. The experiments in a reverberant room reveal that the algorithm results in higher stability and higher separation performance. 1 Introduction Blind source separation (BSS) is an approach for estimating original source sig nals only from the information of the mixed signals observed in each input chan nel. This technique is applicable to high-quality hand-free speech recognition systems. Many BSS methods based on independent component analysis (ICA) [lJ have been proposed [2，3J for the acoustic signal separation. However， the performances of these methods degrade seriously， especially under heavily re verberant conditions. In order to improve the separation performance， we have proposed multistage ICA (MSICA) involving subarray processing [4]， in which frequency-domain ICA (FDICA) [3，5] and time-domain ICA (TDICA) [2，6] are cascaded (see Fig. 1). In this method，自rst， we divide the observed signals in a microphone array into the observed signals in the subarrays. In every subarray ，

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Convolutive independent component analysis by leave-one-out optimal kernel approximation

This work addresses on blind separation of convolutive mixtures of independent sources. The temporally convolutive structure is assumed to be composed of multiple mixing matrices, each corresponding to a time delay, collectively transforming a segment of consecutive source signals to form multichannel observations. As τ = 1, this problem reduces to linear independent component analysis. For arb...

متن کامل

Blind Source Separation of Convolutive Audio Using an Adaptive Stereo Basis

We consider the problem of convolutive blind source separation of audio mixtures. We propose an Adaptive Stereo Basis (ASB) method based on learning a set of basis vectors pairs from the time-domain stereo mixtures. The basis vector pairs are clustered using estimated directions of arrival (DOAs) such that each basis vector pair is associated with one source. The ASB method is compared with the...

متن کامل

Blind Separation of Convolved Sources Based on Information Maximization

Blind separation of independent sources from their convolutive mixtures is a problem in many real world multi-sensor applications. In this paper we present a solution to this problem based on the information maximization principle, which was recently proposed by Bell and Sejnowski for the case of blind separation of instantaneous mixtures. We present a feedback network architecture capable of c...

متن کامل

Subband-Based Blind Separation for Convolutive Mixtures of Speech

We propose utilizing subband-based blind source separation (BSS) for convolutive mixtures of speech. This is motivated by the drawback of frequency-domain BSS, i.e., when a long frame with a fixed long frame-shift is used to cover reverberation, the number of samples in each frequency decreases and the separation performance is degraded. In subband BSS, (1) by using a moderate number of subband...

متن کامل

On-line Convolutive Blind Source Separation of Non-Stationary Signals

A novel algorithm is proposed in this paper to solve blind source separation of post-nonlinear convolutive mixtures of non-stationary sources. Both convolutive mixing and post-nonlinear distortion are included in the proposed model. Based on the generalized Expectation-Maximization (EM) algorithm, the Maximum Likelihood (ML) approach is developed to estimate the parameters in the model. A set o...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Stable and Low-Distortion Algorithm Based on Overdetermined Blind Separation for Convolutive Mixtures of Speech

نویسندگان

چکیده

منابع مشابه

Convolutive independent component analysis by leave-one-out optimal kernel approximation

Blind Source Separation of Convolutive Audio Using an Adaptive Stereo Basis

Blind Separation of Convolved Sources Based on Information Maximization

Subband-Based Blind Separation for Convolutive Mixtures of Speech

On-line Convolutive Blind Source Separation of Non-Stationary Signals

عنوان ژورنال:

اشتراک گذاری